Brainstormers 2 D — Team Description 2009

نویسنده

  • M. Riedmiller
چکیده

The main focus of the Brainstormers’ effort in the RoboCup soccer simulation 2D domain is to develop and to apply machine learning techniques in complex domains. In particular, we are interested in applying reinforcement learning methods, where the training signal is only given in terms of success or failure. Our final goal is a learning system, where we only plug in “win the match” – and our agents learn to generate the appropriate behavior. Unfortunately, even from very optimistic complexity estimations it becomes obvious, that in the soccer simulation domain, both conventional solution methods and also advanced today’s reinforcement learning techniques come to their limit – there are more than (108×50) different states and more than (1000) different policies per agent per half time. This paper outlines the architecture of the Brainstormers team, focuses on the use of reinforcement learning to learn various elements of our agents’ behavior, and highlights other advanced artificial intelligence methods we are employing.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Brainstormers 2 D – Team Description 2005

The main interest behind the Brainstormers’ effort in the RoboCup soccer domain is to develop and to apply machine learning techniques in complex domains. In particular, we are interested in applying Reinforcement Learning methods, where the training signal is only given in terms of success or failure. Our final goal is a learning system, where we only plug in ’win the match’ – and our agents l...

متن کامل

Brainstormers 2003 - Team Description

The main interest behind the Brainstormers’ effort in the robocup soccer domain is to develop and to apply machine learning techniques in complex domains. Especially, we are interested in reinforcement learning methods, where the training signal is only given in terms of success or failure. Our final goal is a learning system, where we only plug in ’win the match’ and our agents learn to genera...

متن کامل

Description and Analysis of Performance, Potential and Efficiency of Iranian football pro- league by Data Envelopment Analysis

In the case of football it could be argued that the purpose of teams is to win the competitions in which they participated. However, the assessment of  football teams from the efficiency aspect would be relevant in judging whether the results have been obtained without waste. The purpose of this research is to compare  and  analysis of league ranking with  Potential and efficiency ranking. The ...

متن کامل

Playing Soccer with RoboSapien

Due to limited availability of humanoid robots and the high costs involved, multi-agent experiments with humanoid robots have been at least difficult so far. With the introduction of RoboSapien, a low-cost humanoid robot developed for the toy market, this situation has changed. This paper describes how we augmented multiple RoboSapiens to obtain a team of soccer playing humanoid robots. We adde...

متن کامل

WrightEagle2009 2D Soccer Simulation Team Description Paper

In WrightEagle2009, we continue to research based on the previous WrightEagle 2D soccer simulation team. WrightEagle has won the runner-up of RoboCup 2008 and Robocup 2007, the Champion of RoboCup 2006, and the runner-up of RoboCup 2005. In this paper, we mainly present the team structure of our new team WE2009, and the new techniques since the last competitions.

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010